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Abstract 

Membrane proteins move in heterogeneous environments with spatially (sometimes temporally) varying 
friction and with biochemical interactions with various partners. It is important to reliably distinguish 
different modes of motion to improve our knowledge of the membrane architecture and to understand the 
nature of interactions between membrane proteins and their environments. Here, we present an analy¬ 
sis technique for single molecule tracking (SMT) trajectories that can determine the preferred model of 
motion that best matches observed trajectories. The method is based on Bayesian inference to calculate 
the posteriori probability of an observed trajectory according to a certain model. Information theory 
criteria, such as the Bayesian information criterion (BIC), the Akaike information criterion (AIC), and 
modified AIC (AICc), are used to select the preferred model. The considered group of models includes 
free Brownian motion, and confined motion in 2nd or 4th order potentials. We determine the best infor¬ 
mation criteria for classifying trajectories. We tested its limits through simulations matching large sets of 
experimental conditions and we built a decision tree. This decision tree first uses the BIC to distinguish 
between free Brownian motion and confined motion. In a second step, it classifies the confining potential 
further using the AIC. We apply the method to experimental Clostridium Perfingens e-toxin (CPeT) 
receptor trajectories to show that these receptors are confined by a spring-like potential. An adaptation 
of this technique was applied on a sliding window in the temporal dimension along the trajectory. We 
applied this adaptation to experimental CPeT trajectories that lose confinement due to disaggregation of 
confining domains. This new technique adds another dimension to the discussion of SMT data. The mode 
of motion of a receptor might hold more biologically relevant information than the diffusion coefficient 
or domain size and may be a better tool to classify and compare different SMT experiments. 


Introduction 

Advances in single molecule tracking (SMT) techniques, have made it possible to record trajectories of 
individual biomolecules in a large variety of biological systems mm- This allows for new insight into the 
dynamics of membrane proteins and into the structural organization of the membrane. Labeled membrane 
biomolecules can undergo free Brownian diffusion, confined motion, hopping, stabilization by scaffolding 
proteins, anomalous diffusion etc. The complex motion of membrane proteins has been attributed to 
molecular crowding effects mm, intermolecular interactions mm, differences in lipid solubility m , cy- 
toskeleton barriers 013, non-local potential fields induced by the environment HMH, tethering to the 
cytoskeleton [laiTi], lipid rafts or domains [T5l[T6] and hopping between confinement areas [7]. Finally, 
proteins often exhibit a mix between these behaviors that lead to different modes of motion (Fig. [T]top). 

It is important to reliably distinguish between different modes of motion of molecules and to quantify 
their characteristics. This allows to gain deeper insights into the structure of the membrane and to better 
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understand the nature of the interactions between proteins and their environments. The most widely 
spread approach to classify the mode of motion is based on the analysis of the mean-square displacement 
(MSD) of the tracked molecule [T71[T1]. The MSD is usually plotted against the time lag r. In the case 
of Brownian motion, the resulting points should lie on a line, whose slope is proportional to the diffusion 
coefficient D (for 2 dimensional Brownian diffusion MSD(t) = 4I?T“,Qf = 1). If the relationship is 
not linear, the motion of the molecule is classified as subdiffusive (a < 1) or superdiffusive (a > I). 
In the case of confined motion, the particle does not escape from a corral of a certain size during the 
observed time, which will manifest itself in the MSD versus time lag plot through a plateau. Yet, this 
method is known to fail to take into account diffusion heterogeneities and transient confinement may be 
misinterpreted as anomalous diffusion. Hence, it tends to often identify biomolecule motion as subdif¬ 
fusive and leads to extremely wide distributions of diffusion coefficients that are difficult to associate to 
physical characteristics of a medium. A different approach is to analyze the cumulative distribution of 
square displacements for a fixed lag time. Here, the cumulative distribution is analyzed for individual 
trajectories m or multiple trajectories Similarly, this method is sensitive to heterogeneities and 

does not include interactions with the local environment. Image correlation techniques are also used |22) . 
as well as a technique based in Bayesian inference |23| . Other methods exploit higher-order moments of 
the biomolecule displacement [24], first passage times [25] and the analysis of the radial density distri¬ 
bution [14j . These estimators also tend to introduce the heterogeneities into the quantity describing the 
motion or are sensitive to local geometrical effects. Finally, all of these methods exploit only a subset 
of the available information because they either discard part of the full information or loose information 
through averaging. 

A related challenge is to correctly subdivide trajectories that undergo different modes of motion and 
to clearly determine when the mode of motion changed [9l|26]. Receptors have been observed to be 
transiently confined in small corrals and to then continue with free Brownian diffusion |19j . One method 
to detect transient confinement is to evaluate the probability that the tracked molecule will stay within 
a certain region during a short window that is moving along the entire trajectory [271128] . Yet, this 
method fails when the environment is highly heterogeneous or when local interactions are sufficiently 
strong to significantly deviate the trajectories from pure Brownian motion. A more versatile method 
evaluates the diffusion coefficient, the MSD curvature, and trajectory asymmetry with a threshold for a 
variable window. It has been shown to segment trajectories into portions exhibiting stalled, constrained, 
directed or Brownian motion [29] . Different types of motion can also be identified by a characteristic 
feature vector [30] . Features that can distinguish between directed motion, fast and slow drifting motion, 
and confined motion are: net displacement, straightness, bending, efficiency, asymmetry and skewness. 
Another wavelet transform based method can reliably detect dynamic heterogeneities in time series data 
without making prior assumptions about a model, but it does not give information on what model best 
describes the data m- 

In an attempt to address the high heterogeneity of the membrane, to take the local and non-local 
interactions into account and to exploit as much information as possible from SMTs, we previously devel¬ 
oped a Bayesian inference scheme allowing the spatial mapping of both the diffusivity and the potential 
field [711101112] . Here, we extend the Bayesian scheme to classify the nature of protein motion in the mem¬ 
brane by building a solid tool for discriminating between modes of motion. The studied cases are free 
Brownian motion, the harmonic confined trajectories V (x and the anharmonic confined trajectories 
y oc r^. These 3 simple models cover a large set of possible membrane molecule behaviors. Confinement 
can stem from a large variety of interactions, ranging from purely local interactions, for example with 
the cytoskeleton, to highly non local interactions, for example complex organization of different lipids. 
The higher the anharmonicity of the potential, the more it can model localized interactions. The here 
presented classification technique is an extension of the inference technique that can provide a compre- 
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hensive measure of the fit of a used model. The previously introduced inference technique only returns 
the most likely values for the model parameters, but it does not provide a relative value for the goodness 
of fit of the chosen model over alternative models. This quantitative model comparison is an additional 
layer of analysis that is added after the previously introduced inference method is used for each model 
separately. 

To classify the motion, we infer the parameters that best fit an observed trajectory, assuming a model 
for the motion and obtain the maximum a posteriori (MAP). Using the MAP, we can calculate various 
information based criteria, such as the Bayesian information criterion (BIG) and the Akaike information 
criterion (AIC) for each tested model and use them to determine the preferred model (Figure [T] middle). 
These criteria are often better suited for model selection than the direct likelihood-ratio test, because 
they take the degrees of freedom of the model into account. Moreover, to test the accuracy and to further 
enhance the preferred model selection, we built a decision tree using simulated data for all models (Fig¬ 
ure [T] bottom). We then developed a variant of the technique that performs the model selection with a 
temporal resolution. This provides a tool that can detect changes in the mode of motion of a biomolecule. 

Extensive simulations with numerical trajectories undergoing Brownian motion in confining poten¬ 
tials are used to build a decision tree. The decision tree first determines if a trajectory is undergoing free 
Brownian motion or if it is confined by a potential through the Bayesian information criterion. If the 
trajectory is classified as being confined by a potential, the decision tree can further distinguish between 
potentials of the type V oc and V oc r'^. Note that there are no theoretical limitations imposing the 
analysis to specifically stop at order 4 for the potential. Yet, often experimental recordings are limited to 
few hundred points, hence the inference for higher order potentials would not gather significantly more 
information. Then, we apply this technique to experimental trajectories of the Clostridium Perfingens 
e-toxin (CPeT) receptor on live Madin-Darby Canine Kidney (MDCK) cell membranes and confirm that 
these receptors are confined in a harmonic-like potential, as previously shown m- Furthermore, we 
apply the temporal version of the model selection to experimental CPeT receptor trajectories that lose 
confinement due to disaggregation of confining domains. The CPeT is a member of the pore-forming 
toxin family. It is secreted by the bacterium as prototoxin monomers and activated by an enzymatic 
reaction. An individual toxin binds to a currently unidentified 37-kDa membrane receptor on the cell 
membrane, oligomerizes with other bound toxins, and forms pores that pierce the cell membrane with a 
/3-barrel and cause the death of the cell by uncontrolled ion exchange [32l[33] . 


Results 

Information theory criteria 

To determine the preferred model corresponding to a biomolecule trajectory, we calculate the Bayesian 
information criterion (BIC), the Akaike information criterion (AIC), and a modified version of the AIC 
whose performance improves for short trajectories, the corrected AIC (AICc). These criteria were devel¬ 
oped to determine the preferred model, or the model that best describes a given data set. Note that to 
build the decision tree, trajectories were generated from the 3 predefined models. Experimentally, tra¬ 
jectories will be associated to one of the 3 models which describe a large set of experimental situations, 
yet it is obviously possible to extend the decision tree by adding other types of motion. 

The AIC is based on Kullback-Leibler information loss [33] and can be understood as information 
entropy. The AIC is given by 
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AIC = -2 • ln(L) + 2k 


( 1 ) 


where k is the number of free parameters in the model and L is the MAP of the likelihood function 
assuming a certain model. Note that in our case the number of parameters greatly varies between models 
with 1 variable for the Brownian motion, 6 variables for harmonic confinement and 16 variables for 
anharmonic 4th order confinement. The preferred model is the one that yields the lowest AIC value, 
which is a measure of lost information. The AICc is a variant of the AIC with a correction for finite 
sample sizes N and is given by 


AICc = AIC + 


2k{k + 1) 
N-k-1 


( 2 ) 


The BIC, also known as the Schwarz criterion, is based on Bayes factors and is derived from the 
asymptotic behavior of Bayes estimators under a special class of priors |35j . The preferred model is the 
model that yields the minimal value for: 


BIC = -2 • ln(L) + k ■ \n{N) 


( 3 ) 


These three information criteria are calculated for each individual trajectory and then a decision 
about the preferred model is made according to each criterion. Such an approach is more adapted to 
single-molecule data. Averaging values of the information criteria over many trajectories will lead to 
overlapping distributions and to the inability to decide between different models, as shown in figure SI. 
Furthermore, the high heterogeneity of biological media prevent the use of averaging between different 
spatial parts of the cell. 

We will next evaluate how many numerical trajectories are correctly attributed to their respective 
mode of motion under various conditions matching the ones met experimentally. This will lead to a 
multidimensional map that will give us the experimental parameter range for which this technique is 
valid. Furthermore, we will use these results to build the decision tree. Here, the studied cases are free 
Brownian motion, the harmonic confined trajectories V (x and the anharmonic confined trajectories 
V oc r^. Specihcally, we use the 2nd order potential V{x,y) = l/2kxx^ + l/2fcy?/^ with k = ^k\ + ky 
and the 4th order potential V{x,y) = axx'^ + ayy'^ with a = yj -|- ay. These 3 simple models cover 
a large set of possible membrane molecule behaviors. We will not discuss diffusion heterogeneities, since 
they were treated in previous work [zliiiliig. Indeed, free Brownian motion in heterogeneous diffusive 
environments can be modeled and inferred as free Brownian motion with a locally varying diffusion co¬ 
efficient D{r). 


Free Brownian motion 

Here, we determine how well the three information theory criteria attribute the correct model to nu¬ 
merical free Brownian trajectories. We studied the performance over a range of trajectory lengths (N), 
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diffusion coefficients (-D), and acquisition times {tacq) matching most of the experimental conditions and 
most of biological media properties. Experimentally, the tacq is the time over which the camera integrates 
the arriving photons plus the readout time and images are acquired back to back. In simulations tacq is 
the time between two simulated points. Under each condition, which is specified in the figure caption, 
we simulated 300 trajectories and recorded the normalized histogram of decisions, which are made after 
each trajectory by determining which model yielded the minimal value for a criterion. The performance 
of the BIG, AIC and AICc with respect to the trajectory length is shown in figure [2] (A). The BIG clearly 
outperforms the AIG and the AIGc, even for very short trajectories. Only 20 points are sufficient for the 
BIG to correctly find the model of motion, while a length of 10 points is too short. Fig. [2] (B), shows the 
range of diffusion coefficients, for which the criteria perform well. Again, the BIG outperforms the AIG 
and AIGc in accuracy, as well as range. Under the specified conditions, the BIG reliably attributed the 
correct model to the simulated trajectories for a D between 0.01 firn^/s and 5 /s. This is the range 

of diffusion coefficients that is typically observed for membrane molecules. The BIG also outperforms 
the AIG and AIGc with respect to the possible acquisition times. When the input D is 0.1 [irv? j the 
BIG is reliable for acquisition times between 10 and 200 ms. These data indicate that the BIG is the 
better indicator for free Brownian motion, when compared to the AIG and AIGc. It should be noted that 
this evaluation is solely for the model of the trajectory and not the inferred parameters, which might be 
subject to a bias, as discussed in reference [12]. 


Confined motion in a harmonic potential {V oc r^) 

This section studies the performance of the three information theory criteria for numerical Brownian tra¬ 
jectories that are confined in a harmonic-spring potential (V = l/2fcr^), where k is the spring constant. 
Again, we simulated 300 trajectories under each condition and recorded the normalized histogram of 
decisions, which are made for each trajectory by determining which model yielded the minimal value for 
a criterion. The performance of the BIG, AIG and AIGc with respect to the trajectory length is shown 
in figure El (A). The BIG slightly outperforms both the AIG and AIGc for most trajectory lengths and 
correctly attributes the right potential down to 50 trajectory points. The diffusion coefficient does not 
change the performance of the BIG over a large range of D (Fig. [3](B)). The AIG and AIGc only perform 
down to a D of 0.05 ^m^/s. Figure[3] (G) shows that the BIG outperforms both the AIG and AIGc and is 
correct over a larger range of acquisition times. The impact of the strength of the potential is investigated 
in Fig. 13] (D). The BIG is the better indicator and the performance remains constant, except for low 
k. This investigation shows that the BIG performs overall better in determining the preferred model for 
Brownian trajectories that are confined by a spring potential. A histogram of decisions for parameters 
close to experimental conditions is given in Fig. S2. 


Confined motion in an anharmonic potential {V oc r^) 

The last investigated case are numerical Brownian trajectories that are confined by a 4th order (anhar¬ 
monic) potential V = ar'^. As before, we simulated 300 trajectories under each condition and recorded 
the normalized histogram of decisions made for each trajectory. The performance of the BIG, AIG and 
AIGc with respect to the trajectory length is shown in figure ID (A). In stark contrast to the previous two 
cases, the BIG does not perform well and fails to attribute the correct potential. However, both the AIG 
and AIGc do find the correct potential down to a trajectory length of 400 points. As shown in Fig. |4| 
(B), the AIG and AIGc can determine the correct model up to a diffusion coefficient of 0.1 while 

the BIG cannot. The AIG and AIGc can determine the correct model for acquisition times below 200 ms 
when D is 0.1/rm^/s (Fig. |4|(G)). Figure|4|(D) shows that, similarly to the spring-potential case above, 
the strength of the confining potential does not significantly influence the ability to choose the correct 
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potential for the AIC and AICc, except at very low a values. These data suggest that the BIG cannot 
be used to determine that a trajectory is confined by a 4th order potential. However, the AIC and AICc 
are good tools to do so. 


Building the decision tree 

Because the BIC cannot always determine the correct model, as shown by the results above, model se¬ 
lection cannot be performed in a unique step with one criterion. As we can see, the BIC can distinguish 
between the free Brownian case and the harmonic potential. However, AIC or AICc are able to distin¬ 
guish between the harmonic potential and the anharmonic potential over a wide parameter range. This 
information points towards building a decision tree with nested models that first distinguishes between 
free motion and confinement and then, in a second step, can determine the shape of the potential. Using 
the idea of the decision tree, we mapped a greater parameter space than previously explored to determine 
the structure and limits of the final decision tree. Simulations mapping the trajectory length N and the 
diffusion coefficient D space, are shown in figure [S] Here, the normalized performance of the three criteria 
for 300 numerical trajectories per condition is shown by a color code after a threshold of 0.5 has been 
applied. The threshold is used because we require that our method should determine the correct model 
over half the time. The three models are shown vertically in columns and the three criteria BIC, AIC 
and AICc are shown for the three models in rows, as indicated. The bright areas show that the model 
could be attributed correctly and black squares indicate that the correct model could not be found, i.e. 
more than 50% of trajectories were falsely classified. 

As previously suspected, the BIC can correctly distinguish between free Brownian motion and the 
harmonic potential (red box), but fails to determine the anharmonic 4th order potentials. The AIC and 
AICc on the other hand, can distinguish between the different potentials (blue box), but cannot be used 
to determine free Brownian motion. Similar results are obtained by looking at the other parameters, such 
as acquisition time tacq and N (Supporting Figure S3). The potential strength also does not change the 
fact that only the AIC and AICc can be used to distinguish the different potentials (Supporting Figure S4). 

The key in building the decision tree was to realize that the anharmonic 4th order potential was 
falsely determined to be a spring potential by the BIC, as shown in in Supporting figure S5. Thus, we 
used the BIC only to determine if a trajectory was undergoing free Brownian motion or if it was confined 
in a potential (red arm in Fig. [T] (bottom)). The simulations showed that this was possible for trajectory 
lengths down to 20 points for most D and acquisition times between 10 and 200 ms. Once, a trajectory 
has been determined to be confined in a potential, the AIC or AICc can be used to classify the potential 
to be spring-like or 4th order. For this classification, the trajectory length has to exceed 500 points, the 
acquisition time has to be below 200 ms and the potential strength of the 4th order potential should be 
in the range of 0.1 to IpN/^m^. These conditions are the consequence of the information accessible in 
the trajectory as demonstrated in [36]. If the tracked biomolecule moves too fast with respect to the 
acquisition time or if the force due to the potential is far greater than the thermal noise, it will become 
increasingly difficult to resolve these effects. Overall, the AIC gives a slightly larger window in which it 
determines the correct potential. Thus, we use the AIC instead of the AICc in our final decision tree. A 
source code that calculates the BIC, AIC and AICc for a given trajectory for the three models is provided 
in the supporting information (Model distinction in trajectory.c) (Source Code Sll). 
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Application to experimental Clostridium Perfingens e-toxin (CPeT) receptor 
trajectories 

We applied the derived decision tree to experimental Clostridium Perfingens e-toxin receptor trajectories. 
This pore-forming toxin binds to a receptor in the cell membrane and undergoes confined motion in lipid 
rafts [ini[TIl[37]. The tracked toxin monomers exploit these confinement zones to locally increase their 
concentration and initiate further biological functions. In previous work, we have analyzed the trajecto¬ 
ries by Bayesian inference and modeled them by a 2nd or 4th order potential. We then used a comparison 
of the magnitude of terms in the two polynomials to conclude that a 2nd order description of the confining 
potential is sufficient because the 4th order terms are small m- With our here improved decision tree 
technique, we analyzed a total of 60 trajectories, which we have cut to only contain 500 points. The 
first step of the decision tree that uses the BIC determined that 59 trajectories are confined, while only 
1 trajectory was attributed to free Brownian motion (Fig. |6] insert). For the confined trajectories, the 
next step in the decision tree uses the AIC to determine the shape of the potential. The blue histogram 
in figure [6] shows that the majority of the experimental trajectories (54) were found to undergo confined 
motion in a spring potential, while only 5 were found to be confined by a 4th order potential. 

The results of the decision tree agree with the previous findings that the CPeT receptor undergoes 
confined motion in a spring-like potential However, the present method is much more flexible and 
fast in classifying the shape of the potential than the previously used method which consisted in fitting 
the confining potential with polynomials of growing order and quantifying the evolution of the likelihood 
along with the error on the inferred parameters. 


Preferred model selection with temporal resolution for multi-mode trajectories 

Single-molecule trajectories often change their mode of motion. Here, we applied the decision tree method 
to classify the mode of motion along a single-molecule trajectory to determine the current mode of motion 
during the observed trajectory. 

To this end, we selected the preferred model via the information criteria along a trajectory us¬ 
ing a window of variable size. We chose simulation parameters, similar to experimental conditions 
{Dinput = 0.1 tacq = 50 ms , Br = 30nm, k = 0.3pN/fim, N = SOOframes). Consulting Fig. 

O we chose a window size of 51 frames that slides along the trajectory. As an example, we studied the 
transition from confined motion in a harmonic spring potential to free Brownian motion half way through 
the numerical trajectory (Fig. [21(A)). 

Figure|2l(B) shows the determined mode of motion for the numerical trajectory shown in|7|(A). The 
Bayesian decision tree can correctly identify the confined part (red) of the trajectory and the free Brown¬ 
ian motion part (blue). A low-pass filter that does not allow switching of modes unless three consecutive 
frame positions of the window yield a mode change, gives a very reliable result (Fig. [21(C)). 

To study the performance in greater detail, we evaluated the mode of motion of 50 numerical tra¬ 
jectories. Figure [71 (D) shows the number of decisions, using the BIC criterion (shown on the x-axis as 
a normalized histogram), as the central frame of the 51-frame time window slides along the trajectories 
(shown on the y-axis). When a frame was deemed free Brownian, it is represented in blue (left side), when 
it was deemed confined in a spring potential it is represented in red (right side). The red and blue counts 
add up to unity for each central frame. The input mode of motion is shown by the black dotted line and 
switches from confined to free at frame 250. On average, the method can correctly identify the mode of 
motion. The BIC is more sensitive towards free Brownian motion, i.e. when some part of the 51-frame 


long window of observation sees free Brownian motion, the decision is deemed Brownian overall. This 
explains why the method finds the frame at which the trajectory switches from confined to Brownian at 
frame number 228 ± 9 for the ensemble of 50 trajectories. The ratio of confined to free Brownian mo¬ 
tion decisions agrees with the ratio previously determined by the simulations for the decision tree (Fig. [5]). 


Comparison of the Bayesian decision tree method to a residence time based 
method 

To further evaluate the precision of the here presented decision tree method we compare it to a commonly 
used residence time based method. The residence time based method detects temporal confinement by 
identifying periods in which the receptor remains in a membrane region for durations longer than a free 
Brownian diffusing particle would stay in an equally sized region [28) . We generate 5 trajectories for two 
different spring constants of 1200 frames length (60 s) that have three distinct temporal confinement zones 
of 200 frames duration (10 s) and test whether the two methods can detect the real confinement zones 
(Fig. [5](A.)) and the level of falsely detected confinement zones, which are not present in the input (Fig. |S] 
(B)). The results show that both methods can identify confinement zones but only the Bayesian decision 
tree method can detect all of the 15 input regions. The real strength of the Bayesian decision tree method 
lies in the low number of false positives, when compared to the residence time based method. The deci¬ 
sion tree method only finds one non-existing confinement region, while the residence time method hnds 

5 and 6 false zones for a spring constant of 0.3 and 0.6pN/^m, respectively. Figure |8] (C-E) and (F-H) 
show the performance of the two methods along two numeric sample trajectories for two different confin¬ 
ing spring constants of 0.3 and 0.6pN/pm, respectively. The residence time based method in Fig. |8] (E 

6 E) fails to detect one input zone and falsely declare a free Brownian section of the trajectory as confined. 


Model selection along experimental Clostridinm Perfingens e-toxin (CPeT) 
receptor trajectories 

We applied the preferred model selection with temporal resolution to an experimental CPeT receptor 
trajectory that changes its mode of motion. As previously shown, the confining membrane domain of the 
CPeT receptor can be destabilized by oxidizing cholesterol in the cell membrane [7]. 

The Bayesian decision tree method determines a change of motion from confined to free Brownian 
motion along the recorded experimental data (Fig. [7] (E)). Here, the trajectory is undergoing conhned 
motion (red) at the beginning of the incubation with the enzyme cholesterol oxidase. An additional 
trajectory is shown in the Supporting Figure S6. As the enzyme gradually oxidizes more cholesterol in 
the membrane, the receptor switches to free Brownian motion (blue). This result agrees with our pre¬ 
vious work, which detected a decrease in confining potential along the CPeT receptor trajectory during 
incubation with cholesterol oxidase. 


Discussion 

We introduced a decision-tree based method that uses information criteria to determine the mode of 
motion of a single-molecule trajectory. The method is based on the combined use of the Bayesian in¬ 
formation criterion (BIC), the Aikaike information criterion (AIC), and a modified version of the AIC 
(AICc). These criteria are used to determine which model best describes a specific data set. The models 
that we discussed in this work are free Brownian motion, confined motion in a harmonic potential and 
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confined motion in a 4th order polynomial potential. All of these models can be associated to various 
identifiable structural characteristics. A spring-like potential can indicate actin tethering or hydrophobic 
interactions and an anharmonic 4th order potential can indicate more local conhnement by the picket- 
fence model, for example. The information criteria allows association of experimental data to these 
models. Thus, this method can provide quantitative information on how much better a certain model 
describes a trajectory with respect to other competing models. Of course, it is possible to extend this 
method to include more models of motion. Furthermore, more complex environments can be numerically 
investigated and associated to more simplified models, such as the ones discussed here to classify trajec¬ 
tories. 

In order to build the decision tree, numerical trajectories were generated under various conditions 
matching previous experimental data in the field of single-molecule tracking. These trajectories were 
either undergoing free Brownian motion or confined motion in a potential. For each trajectory, we calcu¬ 
late the BIC, AIC and AICc using the posteriori probabilities corresponding to each investigated model. 
Then the algorithm chose the preferred model by evaluating which model has the smallest information 
criterion value. The key feature was that this method supplied a reliable method based on simple criteria 
that can automatically classify the mode of motion. 

Surprisingly, the most accurate method in choosing the preferred model is not a simple decision mak¬ 
ing step, but a decision tree where the two confined models are nested. A first decision using the BIC 
determines if the trajectory is undergoing free Brownian motion or if it is confined. Here, the minimal 
trajectory length for meaningful classification is 50 frames under most conditions (D; 0.01 — 10 ^m^/s, 
k< O.lpN/ ^m, and tacq- 10 — 100ms). In a second step, the nature of the confining potential is inves¬ 
tigated. The AIC is used to distinguish between the harmonic potential and the anharmonic potential. 
This distinction can be made with trajectories that have more than 500 frames for diffusion coefficients 
below 0.1 pm'^/s, spring constants of 0.02 — lOpN/fim, and a in the range of 0.1 — 1 pN/pm?. Experi¬ 
mentally determined single molecule diffusion coefficients range from 0.004 to 2.2 pm?/s [38l|39]. Thus, 
the presented method yield reliable results for two out of the three magnitudes spanned by results from 
literature. Experimental spring constants lie in the range of 0.3 — 10 pN/pm [TnfCTfT^ldO] . The method 
can provide an accurate model distinction for the entire range of experimental spring constant values in 
the current literature. The need for at least 500 frames is the largest limitation of the presented technique 
and will provide a challenge for groups that use organic fluorophores for tracking. However, quantum 
dots or fluorescent nanoparticles provide an alternative label that is more photostable. It should be 
noted that we did not discuss the accuracy of the inferred parameters, such as the diffusion coefficient 
in this text and we only focus on the ability to attribute the correct model. They might be subject to 
a bias, as discussed in reference m- We then apply the decision tree to experimental CPeT receptor 
trajectories and show that the method can reliably confirm previous findings about the nature of the 
confining potential of these receptors. 

The method can also be adapted to determine changes in the mode of motion along a single-molecule 
trajectory. To this end, we evaluated the mode of motion using a window of 51 frames that slides along 
the trajectory in time. Simulations show that the method can reliably classify confinement regions on 
numerical trajectories and that it outperforms the commonly used residence time based method. Thus, 
this technique supplies a rigorous and reliable tool to automatically segment and classify SMT data. 
Einally, this method can easily confirm previous findings about the loss of confinement of CPeT receptor 
trajectories due to lipid raft disaggregation. In the previous analysis, the mean diffusion coefficient in¬ 
creased from 0.063±0.0I pm?/s to 0.18±0.02 pm?/s, while the spring constant of the confining potential 
decreased from 237 ± 44:kBT.pm~'^ to 35.4 ± 7.7 kBT.pm~‘^ [7]. In the previous work, we have analyzed 
the change in motion of many receptors and concluded that their confinement reduces but does not be- 
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come entirely free Brownian motion [7]. Here, we have selected the most extreme trajectories from the 
dataset, which do switch to free Brownian motion for the sake of showing a change in the mode of motion. 

In conclusion, we have successfully built a method that can reliably distinguish between different 
modes of motion over a wide parameter range. Furthermore, it is possible to quantify how reliable the 
method is for each group of input parameters. This method can be extended to include further models 
of motion, for example phenomena like active transport and hopping. 

This technique adds another dimension to the discussion of SMT data. Currently, most of the discus¬ 
sions are focused on determining the mean value of quantities, such as diffusion coefficients and confining 
domain sizes. Such measurements, however, are not easily compared over the vast range of experiments 
and tracked biomolecule species. Additionally to parameters, such as diffusion coefficient and domain 
sizes, this method can be used to gain information about the mode of motion and changes in the mode 
of motion. The mode of motion of a receptor might hold more biologically relevant information than the 
diffusion coefficient or domain size and is perhaps a better tool to classify and compare different SMT 
experiments. 


Materials and Methods 

Generating nnmerical trajectories 

In this work, we generate numerical trajectories that undergo free Brownian motion or Brownian motion 
in a 2nd and 4th order potential (H oc H oc r^). To simulate two-dimensional Brownian motion, the 
length of each step was taken from a Gaussian distribution with a standard deviation of V41?At, where the 
input diffusion coefficient D and the acquisition time At. The angle of each step is randomly distributed 
over [0,27r]. Each particle takes 1000 substeps during each At. The substeps are not averaged. If the 
trajectory is confined by a potential, the displacement due to the force generated by the confining potential 
is added to each substep. The confining potentials used, as demonstration, in this work are the 2nd order 
spring potential {V{x,y) = Xj^kxx^ + Xj^kyy^ with k = \/kx + ky), and the 4th order potential 
{V{x,y) = axx'^ + ayy^ with a = yja\ -I- cty). Yet, we emphasize that this approach is not limited 
to spring potentials nor to polynomial potentials. For all numerical trajectories, static positioning noise 
Br = 30 nm was added to the trajectory by an additional displacement taken from a Gaussian distribution 
with standard deviation 2Br with an angle randomly distributed over [0, 27r]. This Gaussian noise models 
all sources of noise, i.e. Poissonian photon shot noise due to signal and fluorescence background, detector 
noise, pixelization effects, and error of the localization algorithm using a Gaussian representation. The 
source code for the trajectory generation is given in the supporting information: Free Brownian motion 
(GenerateBrownianTraj.c) (Source Code S8), Brownian motion confined in a 2nd order spring potential 
(GenerateBrownianTrajin2ndOrderPot.c) (Source Code S9), and Brownian motion confined in a 4th order 
potential (GenerateBrownianTrajin4thOrderPot.c) (Source Code SIO). 

Bayesian Inference 

We developed the Bayesian inference approach to treat single-molecule trajectories in previous works 
[Mill], but we include some information here to make this article self-contained. 

The single-molecule motion is modeled by the overdamped Langevin equation: 
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dr 

dt 


Vy(r) 

7(r) 


V^)m, 


(4) 


with 7 (r) the spatially varying friction coefficient, D{r) the spatially varying diffusion coefficient, V (r) 
the potential acting on the biomolecule and ^(t) the rapidly varying zero-average Gaussian noise. The 
fluctuation-dissipation theorem gives D{r) = kBT/^{r) [JT]. In this work we don’t address the question 
of diffusion heterogeneities by setting D{r) = D. 

The associated Fokker-Planck equation, which governs the evolution of the transition probability 
over time, has no general solution for an arbitrary potential and a spatially varying diffusion coefficient. 
We therefore divide the confinement domain into subdomains using a mesh grid and the points of the 
trajectory are attributed to their respective grid subdomains. Within each subdomain, we consider that 
the potential gradient is constant. Note, that this mesh is not used, when the suspected model is a purely 
Brownian trajectory. This assumption enables us to solve the associated Fokker-Planck equation to Eq. 
m for a constant Fij and D per subdomain (i, j), where the forces in adjacent subdomains is free to vary. 
The assumption leads to the expression of the transition probability. 


(r2-ri-Fij(t2-tl)/7»j)^ 
g 4(r)+CT^/(t2-ti))(t2 - *1 ) 
PcouS^ned{{r2M\r^lM)\P^J.D) = + 


(5) 


with a the amplitude of the positioning noise 2Br. This expression is the probability of going from 
one space-time coordinate (ri, ti) to the next (r 2 , t 2 ) for a diffusivity D and a force with a positioning 
noise tr. The transition probability for the free Brownian motion can be obtained from Eq. [5] by setting 
Fij = 0 and gives: 


Furownian {{r2,t2\ri,ti)\,D) 


(rg-ri)-^ _ 

g 4(l? + o-^/(t2-ti))(t2 - *l) 


47r (D -I- 0-2/ {t 2 - ti)) {t 2 - h) 


( 6 ) 


The overall probability of observing a certain trajectory for a given set of variables is then computed 
by multiplying all the displacement probabilities between all individual points in the dataset, assuming 
that the motion of the molecule is a Markov process. 

Now that the likelihood is known, we may apply Bayes’ rule: 


P{Q\T) 


P{T\Q)Po{Q) 

PoiT) 


(7) 


where P {Q\T) is the posterior or a posteriori probability of the parameters, i. e. the probability 
that the parameters Q take on a specific value given the recording of the trajectory T. P {T\Q) is the 
likelihood of the trajectory, i. e. the probability of recording the trajectory T given a specific value Q 
of the parameters, Pq (Q) the prior probability of the parameters and Pq (T) is a normalization constant 
called the evidence of the model [42l|43]. Without prior knowledge on the parameters, the prior proba¬ 
bility Pq {Q) is supposed to be constant over a broad range of possible values. 
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Calculation of the posteriori probability and inferred value 

The inferred values are obtained from the posteriori probability distribution. All variables are initialized 
at zero. The prior probability Pq (Q) is taken to be unity. A quasi-Newtonian optimization using the 
Broyden-Fletcher-Goldfarb-Shanno (BFGS) algorithm [44] finds the maximum of the a posteriori distri¬ 
bution (MAP) P {T\D, Fij) on a 10 X 10 mesh of subdomains. In the case of inferring the 2nd and 4th 
order potential, the Fy are no longer independent but are governed by 2nd and 4th order polynomials, 
respectively. The number of free parameters for the pure Brownian case is thus 1, for the 2nd order 
potential it is 6 , for the 4th order it is 15. The inference algorithms were programmed in C language and 
executed on a local PG (dual-core 3 GHz, 2 GB RAM) or Amazon web services (AWS) on cl.medium 
instances (high-CPU computing instance). 


Calculation of Information criteria along the trajectory 

We determine the mode of motion along a trajectory that changes its mode of motion by calculating the 
BIG, AIG and AIGc for a window of 51 frames, which slides along the trajectory in time. The information 
criteria are calculated for each central frame i of the window, by evaluating the posteriori probability 
and subsequently the information criteria for the trajectory portion between frame z — 25 to frame i + 25. 
We only distinguish between free Brownian motion and confinement in a spring potential. To test the 
algorithm, we generate 50 numerical trajectories with parameters: Umput = 0.1 tacq = 50 ms, 
Br = 30nm, k = 0.3pN/p,m, N = 500frames. Fig. jTjshows that the decision tree can distinguish these 
two modes of motion under these conditions. 

Clostridium Perfingens e-toxin (CPeT) receptor tracking 

Yo, 6 Euo. 4 V 04 nanoparticles were coupled to toxins as described in [45]. In brief, we coupled APTES- 
coated europium-doped nanoparticles to e-toxin produced by C. perfringens bacteria (CpeT) via the 
amine-reactive cross-linker bis (sulfosuccinimidyl) suberate (BS 3 ), as described in [TT]. A BCA test used 
to determine the amount of toxin after the coupling process, showed a toxin:nanoparticle coupling ratio 
of 3:1. 

Tracking experiments were performed with a wide-held inverted microscope (Zeiss Axiovert 100) 
equipped with a 63x, NA = 1.4 oil immersion objective and a EM-CCD (Roper Scientihc QuantEM:512SG) 
NPs were excited with an Ar+-ion laser using the 465.8 nm line. A 500DRLP beam splitter (Omega) di¬ 
rects the beam towards the sample. Emission was collected through the beam splitter and a 617/8M hlter 
(Ghroma). Gonfluent cells on coverslips were incubated with 0.04nM NP-labeled proto-toxin (CPepT) 
for 20 minutes at 37 °G. The concentration was low to avoid oligomerization and observe single NPs (« 10 
per cell). The sample was then rinsed three times with observation medium (HBSS -I- lOmM HEPES 
buffer, 1% FGS) to remove non-bound toxins and nanoparticles. We recorded images at a frame rate of 
about 20 Hz (exposure time: 50 ms; readout time: 1.3 ms) and an excitation intensity of 0.25A:lF/cm^ 
at 37 °C in observation medium. We have verihed that the nanoparticles do not perturb the motion of 
the CPeT receptor through tests with substitute organic fluorophore labeling and the use antibodies m- 

The toxin receptor position in each frame was determined from a Gaussian ht to the diffraction pattern 
of the nanoparticles with a home-made Matlab V8.2 (Mathworks, Natrick MA) algorithm. 

MDCK Cell Culture 

Madin-Darby canine kidney (MDGK) cells were cultured in (DMEM, 10% fetal calf serum (FGS), 1% 
penicillin-streptomycin) culture medium at 37 °G. For tracking experiments, cells were trypsinated two 
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days before and transferred onto acid-bath treated glass microscope coverslips and grown until confluent. 
The medium was replaced by an observation medium (HBSS -I- lOmM HEPES buffer, 1% FCS, pH 7.4) 
just before the tracking experiment, which lasted less than 1.5 h. 

Cell treatment with Cholesterol oxidase 

Where mentioned, we incubated cells with 20 U/mL cholesterol oxidase (Calbiochem) in HBSS-|-10 toM 
HEPES for 30 minutes. A cholesterol quantification kit (Invitrogen) was used to determine successful 
cholesterol digestion on lyzed cells before and after incubation. The data has been previously used to 
examine membrane-protein interactions [7]. 

Residence time method for the detection of temporary lateral confinement 

We followed the work by Simson et al. [28]. The probability 4/ that a protein with diffusion coefficient 
H, will stay in a circular region with radius R for a time period t^d is calculated by: 


log'S = 0.2048 - 2.5117 Dt^d/R^ 


( 8 ) 


We compute this probability along the trajectory for a window of 51 frames (t^d = 2.55 s), with 
R being the maximal displacement from the center position of the 51 frame window and known input 
diffusion coefficient. To accentuate the region of nonrandom behavior, we calculate the probability level 
L according to: 

L=( I"*-"!' (9) 

\ 0 for 4' > 0.1, ^ ^ 

L is filtered in magnitude with a cutoff Lc = 4 and temporal tc = 2.5 s. A plot of L along a numerical 
sample trajectory is shown in Fig. S7. To qualify as a confinement zone, L has to be larger than Lc for a 
time greater than tc- The parameters were optimized to detect the confinement zones without generating 
many false positives. 
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Figure 1. Bayesian Decision Tree for the Classification of Single-Molecule Trajectories. 

Biomolecules undergo a variety of different modes of motion in the cell membrane, which are often 
difficult to distinguish. We show Brownian motion, and confined motion in a 2nd and 4th order 
potential as examples for receptors that might reside in lipid rafts or move according to the picket-fence 
model. Using a Bayesian inference and a decision tree, which can be developed through simulations 
with known modes of motion, it is possible to easily classify modes of motion of molecules in the cell 
membrane. Adapted decision criteria, such as the Bayesian information criterion (BIC) or the Akaike 
information criterion (AIK) can be computed from the maximum a posteriori distribution (MAP) and 
used to make decisions on the single-trajectory level. The decision tree that was derived for this work is 
shown in the bottom of the figure. We first use the BIC (red) to determine if a potential confines the 
biomolecule and then classify the type of potential using the AIC (blue). 



















































18 



Number of points (N) 




Figure 2. Information criteria for simnlated free Brownian trajectories. To determine the 
performance of the decision criteria, we calculated the BIG (black), AIC (blue) and AICc (red) for 
simulated Brownian trajectories under various conditions. (A) Percentage of correct decisions 
(300 trajectories per point) versus the length of the trajectory (Parameters: Dinput = 0.1 
tacq = 50 ms, Br = 30 nm). The BIG outperforms the AIG and AIGc. (B) Percentage of correct 
decisions versus the input diffusion coefficient (Parameters: N= 500points, tacq = 50ms, Br = 30nm). 
The BIG outperforms the AIG and AIGc and works down to a diffusion coefficient of 0.01 /s. (G) 

Percentage of correct decisions versus acquisition time (Parameters: Dinput = 0.1 /s, 

N = 500points, Br = 30nm). The BIG outperforms the AIG and AIGc and works for acquisition times 
between 10 ms and 1000 ms. 
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Figure 3. Information criteria for simnlated Brownian trajectories confined in a 
spring-like potential (V = l/2kr^). To determine the performance of the decision criteria, we 
calculated the BIC (black), AIC (blue) and AICc (red) for trajectories under various conditions. (A) 
Percentage of correct decisions (300 trajectories per point) versus the length of the trajectory 
(Parameters: Dinput = 0.1 tacq = 50ms, = 30nm, k = O.OpN/fim). The BIC outperforms 

the AIC and AICc. (B) Percentage of correct decisions versus the input diffusion coefficient 
(Parameters: N= 500points, tacq = 50ms, = 30nm, k = 0.3pN/pm). The BIC outperforms the 

AIC and AICc and works down to a diffusion coefficient of O.OI pm?/s. (C) Percentage of correct 
decisions versus acquisition time (Parameters: Di„pat = 0.1 pm?/s, N = 500 points, B,. = 30 nm, 
k = 0.3pN/pm). The BIC outperforms the AIC and AICc and works for acquisition times between 
1 ms and 1000 ms. (D) Percentage of correct decisions versus input spring constant (Parameters: 
Dinput = 0.1 pm^/s, N = 500points, Br = 30nm). The BIC outperforms the AIC and AICc. 
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Figure 4. Information criteria for simulated Brownian trajectories confined in a 4th order 
potential (V = ar^). To determine the performance of the decision criteria, we calculated the BIC 
(black), AIC (blue) and AICc (red) for trajectories under various conditions. (A) Percentage of correct 
decisions (300 trajectories per point) versus the length of the trajectory (Parameters: 

Dinput = js, tacq = 50 ms, Br = 30nm, a = 0.5pN/. The AIC slightly outperforms AICc 

for shorter trajectories. The BIC classifies the trajectory erroneously as confined in a 2nd order 
potential. (B) Percentage of correct decisions versus the input diffusion coefficient (Parameters: 

N = 500points, tacq = 50ms, Br = 30nm, a = 0.5pN/pm^). The AICc slightly outperforms AIC for 
small diffusion coefficients. The BIC is wrong and classifies the trajectory erroneously as confined in a 
2nd order potential. The AIC and AICc cease to work for diffusion coefficients higher than 0.2 pm?/s. 
(C) Percentage of correct decisions versus acquisition time (Parameters: Dmput = 0.1 pm'^/s, 

N = 500points, Br = 30nm, a = 0.5pN/pm?). The AICc slightly outperforms AIC for short 
acquisition times. The BIC does not work, i.e. correct attribution below 50%. The AIC and AICc cease 
to work for acquisition times longer than 100 ms. (D) Percentage of correct decisions versus input 
potential strength (Parameters: Dinput = 0.1/rm^/s, N = 500 points, Br = 30 nm). The AIC and AICc 
work equally well and outperform the BIC. Very weak potentials cannot be detected. 
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Figure 5. Building the decision tree using information criteria from simulated trajectories. 

The 2D plots show the heat map of the percentage of correct decisions out of 300 simulated trajectories 
per square for the BIC (first row), AIC (middle row), and AICc (bottom row). The input trajectories 
were free Brownian (left column). Brownian confined in a 2nd order spring potential V (x (middle 
column), and Brownian confined in a 4th order potential V oc. r'^ (right column). The heat map has a 
threshold of 0.5, which means that only cases where the information criterion works correctly more than 
half of the time are non-black as indicated by the color scale. The BIC is the better criterion to 
determine if a trajectory is undergoing purely Brownian motion or if is confined by a potential (red box 
& red arm in decision tree in Fig. [T]). The BIC is not suited to distinguish between a 2nd and 4th order 
potential. Here, the AIC and AICc provide a solution (blue box & blue arm in decision tree in Fig. [1}. 
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Figure 6. Classification of experimental Clostridium Perfingens e-toxin (CPeT) receptor 
trajectories. We apply the decision tree in Fig. [I]to 60 experimental trajectories with a length of 500 
frames. First we use the BIC to determine that the trajectories are confined (red insert). 59 trajectories 
were found to be confined while one trajectory was attributed to free Brownian motion. The AICc 
shows that the CPeT receptors are confined in a 2nd order potential F oc which is in agreement with 
previous results m- 
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Figure 7. Classification of the mode of motion along a trajectory. (A) We apply the first part 
of the decision tree in Fig. [T]to single numerical trajectories, which switch from being confined by a 
spring potential (red) to free Brownian motion (blue). (Parameters: Dinput = 0.1 tacq = 50ms, 
Br = 30 nm, k = 0.3pN/^m). (B) shows the result of using the BIC criterion along the numerical 
trajectory shown in (A). We use a window of 51 frames that slides along the trajectory, and a 
classification is made for each central frame of the window. The method can correctly identify 
confinement (red). (C) Low-pass filtering the classifications gives a very robust method to determine 
the mode of motion of a trajectory that changes. (D) shows the performance of the BIC along the 500 
frames of 50 numeric trajectories. The input mode is shown by the black dotted line, which is at first 
confined and switches to free Brownian motion at frame 250. The blue histogram shows the number of 
free Brownian classifications at a certain central frame. The red histogram shows the number of 
spring-potential confined classifications. (E) Classification along a CPeT receptor trajectory, while 
confinement is reduced due to a modification of the cell membrane. 
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Figure 8. Comparison to the residence time method. We apply the first part of the decision tree 
in Fig. [T]to single numerical trajectories, which cycle between confinement by a spring potential and 
free Brownian motion. (Parameters: Dinput = 0.1 tacq = 50 ms, = 30 nm, k = 0.3 & 

0.6pN/fim). Receptors are confined three times for 200 frames (10s). (A) Histogram of the correctly 
found confinement zones out of 15 zones for the Bayesian decision tree method (blue) and the residence 
time method (red) for two spring constants. We use a window of 51 frames that slides along the 
trajectory, and a classihcation is made for each central frame of the window. (B) shows a histogram of 
non-existing found confinement zones for the Bayesian decision tree method (blue) and the residence 
time method (red) for two spring constants. (C & F) One of the five input trajectories with three 
confining zones (red) with a spring constant of A: = O.SpN/pm and k = O.OpN/pm, respectively. (D & 
G) Result using the Bayesian decision tree method with free brownian motion in blue and confined 
motion in red. (E & H) Result using the residence time method with free brownian motion in blue and 
confined motion in red. 
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Supporting Information 



Information Criterion Value 



- 1 - 

Brownian 2nd Order 4th Order 


Decision 


Figure SI. Averaged versus single-trajectory decision making. (A) Calculating and averaging 
the information theory criteria, such as the here displayed AIC, for a distribution of 100 numerical 
trajectories (Parameters: D = 0.1 N = 500 points, tacq = 50 ms, Br = 30 nm, a = 0.5pN/ 

shows that it is possible to distinguish between free Brownian motion and confined motion, but it is 
impossible to determine the nature of the confining potential from the averaged data. (B) However, 
making decisions based on the criteria for each individual trajectory can lead to a histogram that 
correctly identifies the input potential. 



Model 


Figure S2. Histogram of trajectory classifications out of 300 numerical input trajectories 
that resemble experimental trajectories with confinement in a spring-like potential. 

(Parameters: D = 0.1 pm^/s, N = 500points, tacq = 50ms, Br = 30nm, k = O.SpN/pm). Decisions 
based on the BIG are shown in black, decisions based on the AIC and AICc are shown in red and blue, 
respectively. 
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Figure S3. Building the decision tree using information criteria from simulated 
trajectories. The 2D plots show the heat map of the percentage of correct decisions out of 300 
simulated trajectories per square for the BIG (first row), AIC (middle row), and AICc (bottom row). 
The input trajectories were Brownian (left column), confined in a spring potential V = 1/2 kr^ (middle 
column), and confined in a 4th order potential V = (right column). The heat map has a threshold 
of 0.5, which means that only cases where the information criterion works correctly more than half of 
the time are non-black as indicated by the color scale. The BIG is the better criterion to determine if a 
trajectory is undergoing purely Brownian motion or if is confined by a potential (red box & red arm in 
decision tree in Fig. 1). The BIG is not suited to distinguish between a 2nd and 4th order potential. 
Here, the AIG and AIGc provide a solution (blue box & blue arm in decision tree in Fig. 1). 
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Figure S4. Building the decision tree using information criteria from simulated 
trajectories. The 2D plots show the heat map of the percentage of correct decisions out of 300 
simulated trajectories per square for the BIG (first row), AIC (middle row), and AICc (bottom row). 
The input trajectories were Brownian walkers, confined in a spring potential F = 1/2 kr^ (left column 
A-C), and confined in a 4th order potential V = ar^ (right column D-F). The heat map has a threshold 
of 0.5, which means that only cases where the information criterion works correctly more than half of 
the time are non-black as indicated by the color scale. The AIC and AICc are the only effective 
indicator that can distinguish between these two potential types. However, the strength of the potential 
does not have a large impact on the performance of these criteria. 
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Figure S5. The BIC erroneously classifies the 4th order potentials to be 2nd order spring 
potentials. The 2D plot shows the heat map of the percentage of trajectories classified to have a 2nd 
order spring potential out of 300 simulated 4th order {V = ar^) trajectories per square for the BIC. 

The heat map has a threshold of 0.5, which means that only cases where the information criterion 
falsely chooses 2nd order more than half of the time are non-black as indicated by the color scale. As 
mentioned earlier, the BIC cannot correctly attribute the 4th order potential, but finds a 2nd order 
spring-potential instead. Although this is clearly wrong, it can be exploited to build a two-step decision 
tree that can correctly distinguish all three cases using a mixture of BIC and AIC. 
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Figure S6. Inferring the mode of motion of single experimental CPT receptors during 
disaggregation of the confining domain. Two different trajectories are shown in A and B. We use 
the decision tree to determine the mode of motion on a 51 frame window that slides along the 
trajectory in time, while the cells are treated with cholesterol oxidase. The trajectory begins being 
confined by a spring-like potential (red). As the enzyme cholesterol oxidase oxidizes more cholesterol, 
the trajectory receptor becomes less confined and undergoes free Brownian motion (blue). 
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Figure S7. Residence time method for the detection of temporal lateral confinement. A 

single trajectory has three different confinement zones in a second order potential. (Parameters: 

Dinput = 0.1 tacq = 50 ms, Br = 30 nm, k = 0.3pN/fim). The three confined parts of the 

trajectory are shown in black where a non-zero value indicates confinement. Outside of the confined 
zones, the trajectory undergoes free Brownian motion. The probability level {L) is shown in red along 
the trajectory. L is filtered in magnitude with a cutoff Lc = ^ (dotted line) and temporal tc = 2.5 s. To 
qualify as a confinement zone, L has to be larger than L,. for a time greater than tc- The parameters 
were optimized to detect the confinement zones without generating many false positives. The threshold 
result displaying the confinement zones is shown in green. The method could correctly determine two 
out of three confining zones and found one false confinement zone (false-positive). 









































Source code 

Source codes available at: littp://journals.plos.org/plosone/article?id=10.1371/journal.pone.0082799, 
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Source Code S8: Code in C {GenerateBrownianTraj.c) to generate a trajectory undergoing free Brow¬ 
nian motion. 

Source Code S9: Code in C {GenerateBrownianTrajin2ndOrderPot.c) to generate a trajectory under¬ 
going free Brownian motion confined in a 2nd order spring potential. 

Source Code SIO: Code in C {GenerateBrownianTrajin4thOrderPot.c) to generate a trajectory un¬ 
dergoing free Brownian motion confined in a 4th order potential. 

Source Code Sll: Code in C {Model distinction in trajectory.c) to calculate the information criteria 
for a given trajectory. This algorithm requires the files: nrutil.c (Source code S12), nrutil.h (Source code 
S13) and simplex, c (Source code S14). 

Source Code S12: Code in C {nrutil.c) is required to run Model distinction in trajectory.c (Source 
Code Sll). 

Source Code S13: Code in C {nrutil.h) is required to run Model distinction in trajectory.c (Source 
Code Sll). 

Source Code S14: Code in C {simplex, c) is required to run Model distinction in trajectory.c (Source 
Code Sll). 



